Adaptive exploration of computer experiment parameter spaces

نویسندگان

  • Robert B. Gramacy
  • Herbert K. H. Lee
  • William G. Macready
چکیده

Many complex phenomena are difficult to investigate directly through controlled experiments. Instead, computer simulation is becoming a commonplace alternative to providing insight into such phenomena. The drive towards higher fidelity simulation continues to tax the fastest of computers, even in highly distributed computing environments. Computational fluid dynamics simulations in which fluid flow phenomena are modeled are an excellent example—fluid flows over complex surfaces may be modeled accurately but only at the cost of supercomputer resources. In this article, we discuss the problem of fitting a response surface for a computer model when we also have the ability to design the experiment adaptively, updating the experiment as we learn about the model– a task to which we feel the Bayesian approach is particularly well-suited. Much of what is presented here follows our work in Gramacy et al. (2004). Without an analytic representation of the mapping between inputs and outputs, simulations must be run for many different input configurations in order to build up an understanding of its possible outcomes. Computational expense of the simulation and/or high dimensional inputs often prohibit the naive approach of running the experiment over a dense grid of possible inputs. However, computationally inexpensive surrogate models can often provide accurate approximations to the simulation, especially in regions of the input space where the response is easily predicted. For example, consider a model for the computational fluid dynamics of flight conditions for a proposed reusable NASA launch vehicle called the Langley Glide-Back Booster. The simulations involve the integration of the inviscid Euler equations over a mesh of 1.4 million cells. Each run of the Euler solver for a given set of parameters takes on the order of 5-20 hours on a high end workstation. There are three input parameters (side slip angle, Mach number, angle of attack). Six outputs are monitored (lift, drag, pitch, side-force, yaw, roll). The left panel of Figure 1 shows lift as a function of speed and angle of attack. Of note is the large ridge at Mach 1, where the flight abruptly transitions from subsonic to supersonic. While most of the output space is rather smooth, the ridge is clearly not. Thus there is interest in being able to automatically explore this surface, learning about the ridge and spending relatively more effort there than in the smooth regions. The above experiment is an example of a situation where surrogate models combined with active learning techniques could direct future sampling, dramatically reducing the size of the final experimental design, saving thousands of hours of computing time. Sampling can be focused on input configurations where the surrogate model is least sure of its predicted response, either because the output response is changing significantly or because there are relatively few nearby data points already examined. The traditional surrogate model used to approximate outputs to computer experiments is the Gaussian process (GP). GPs are conceptually straightforward, easily accommodate prior knowledge in the form of covariance functions, and return a confidence around predictions. In spite of its simplicity, there are three important disadvantages to standard GPs in our setting. Firstly, inference on the GP scales poorly with the number of data points, typically requiring computing time that grows with the cube of the sample size. Secondly, GP models are usually stationary in that the same covariance structure is used throughout the entire input space. In the applications we have in mind, where subsonic flow is quite different than supersonic flow, this limitation is unacceptable. Thirdly, the error (standard deviation) associated with a predicted response under a GP model does not directly depend on any of the previously observed output responses. All of these shortcomings may be addressed by partitioning the input space into regions, and fitting separate GPs within each region. Partitioning allows for modeling of non-stationary behavior, and can ameliorate some of the computational demands by fitting models to less data. Finally, a fully Bayesian

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

APPLICATION OF ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM FOR THE ASSESSMENT OF DAMAGED ZONE AROUND UNDERGROUND SPACES

The development of an excavation damaged zone (EDZ) around an underground excavation can change the physical, mechanical and hydraulic behaviors of the rock mass near an underground space. This might result in endangering safety, achievement of costs and excavation planed. This paper presents an approach to build a prediction model for the assessment of EDZ, based upon rock mass characteristics...

متن کامل

An Adaptive Approach to Increase Accuracy of Forward Algorithm for Solving Evaluation Problems on Unstable Statistical Data Set

Nowadays, Hidden Markov models are extensively utilized for modeling stochastic processes. These models help researchers establish and implement the desired theoretical foundations using Markov algorithms such as Forward one. however, Using Stability hypothesis and the mean statistic for determining the values of Markov functions on unstable statistical data set has led to a significant reducti...

متن کامل

The Divergent Interface: Supporting Creative Exploration of Parameter Spaces

This paper outlines a theoretical framework for creative technology based on two contrasting processes: divergent exploration and convergent optimisation. We claim that these two cases require different gesture-to-parameter mapping properties. We present results from a user experiment that motivates this theory. The experiment was conducted using a publicly available iPad app: “Sonic Zoom”. Par...

متن کامل

Techniques to Improve Exploration Efficiency of Parallel Self Adaptive Genetic Algorithms by Dispensing Synchronization

Exploration efficiency of GAs depends on parameter values such as mutation rate and crossover rate. To save labor of manually adjusting these values, GAs which automatically adjust parameter values(Adaptive GAs) have been proposed. We have proposed Self Adaptive Island GA(SAIGA), which does not require adjusting parameter values, and has search performance comparable to that of SGA with manuall...

متن کامل

Self Adaptive Artificial Bee Colony

Artificial Bee Colony (ABC) optimization algorithm is a swarm intelligence based nature inspired algorithm, which has been proved a competitive algorithm with some popular natureinspired algorithms. It is found that ABC is more efficient in exploration as compare to exploitation. With a motivation to balance exploration and exploitation capabilities of ABC, this paper presents an adaptive versi...

متن کامل

Adaptive scanning - a proposal how to scan theoretical predictions over a multi-dimensional parameter space efficiently

A method is presented to exploit adaptive integration algorithms using importance sampling, like VEGAS, for the task of scanning theoretical predictions depending on a multi-dimensional parameter space. Usually, a parameter scan is performed with emphasis on certain features of a theoretical prediction. Adaptive integration algorithms are wellsuited to perform this task very efficiently. Predic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005